Utilization of disk buffer for background replication processes

ABSTRACT

A method for replicating data from a first volume to a second volume includes receiving a first data request comprising a request for a first portion of data, wherein the first portion is part of a first volume. The first portion of data is read, and so is at least a second portion of data in addition to the first portion of data requested in the first data request. In response to determining that the second portion of data should be replicated to the second volume, the second portion of data is written to the second volume.

FIELD OF THE INVENTION

The present invention relates generally to storage devices and, more particularly, to utilizing a disk buffer to improve the performance of background replication processes.

BACKGROUND

Presently, hard disk drives are one of the more commonly used forms of electronic data storage devices. Hard disk drives contain one or more magnetic disks (“platters”) which serve as the storage media. Each platter includes a plurality of concentric circular tracks on its surfaces, and each platter is divided into geometrical sectors. The intersections of the geometrical sectors and the concentric circular tracks define data sectors, which typically consist of 512 bytes of data storage space each. Magnetic heads move into position over the appropriate tracks (i.e., seeking) and, as the platters rotate beneath, the magnetic heads write data to one or more sectors on those tracks as a series of magnetic polarity transitions. The magnetic heads can also read stored data by detecting these magnetic polarity transitions (or the absence thereof).

Hard disk drives typically include a disk buffer, which is an embedded memory that helps increase performance. For example, when a request is received for one or more sectors of data, there is a reasonable likelihood that subsequent requests will be made for data in the sectors located before and after the requested data on the same track. Storing that data in the disk buffer therefore helps increase performance, as the request and subsequent requests for the data are fulfilled from the disk buffer rather than requiring additional seeking and reads from the platters.

A storage controller manages multiple hard disk drives and can present the hard disk drives to a computer as one or more logical volumes. For example, a storage controller can receive application requests to write data to a particular volume and, based on the geometry of the platters (e.g., track density, sector size, etc.), the storage controller can instruct the appropriate hard disk drive to write the data in the appropriate sectors.

Storage controllers typically have the ability to replicate volumes in the background while still fulfilling input and output (I/O) requests made by applications. A typical replication process involves sequentially reading sectors from a first volume (i.e., on a first hard disk drive) and writing those sectors to a second volume (i.e., on a second hard disk drive). Such replication processes can involve considerable head movement between different tracks of the hard disk drive, resulting in seek delays that can affect the performance of both the replication process and the fulfillment of application I/O requests.

SUMMARY

A method for replicating data from a first volume to a second volume includes receiving a first data request comprising a request for a first portion of data, wherein the first portion is part of a first volume. The first portion of data is read, and so is at least a second portion of data in addition to the first portion of data requested in the first data request. In response to determining that the second portion of data should be replicated to the second volume, the second portion of data is written to the second volume.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

FIG. 1 is a block and data flow diagram of a hard disk drive environment in accordance with an exemplary embodiment of the present invention.

FIG. 2 is a flowchart illustrating operational steps of control logic for performing an accelerated background replication process in accordance with an exemplary embodiment of the present invention.

FIG. 3 is a flowchart illustrating operational steps of control logic for performing an accelerated background replication process in accordance with another exemplary embodiment of the present invention.

FIG. 4 is a diagram of a platter illustrating a hypothetical scenario in which an accelerated background replication process is performed in accordance with the control logic of FIG. 2.

FIG. 5 is a diagram of a platter illustrating a hypothetical scenario in which an accelerated background replication process is performed in accordance with the control logic of FIG. 3.

DETAILED DESCRIPTION

Existing methods for a background replication process can interfere with application input/output (I/O) requests due to random head movement caused by READ requests generated by interleaved background copying and application I/O. Embodiments of the present invention recognize that under heavy load application I/O load scenarios, background copy rate is typically throttled down to reduce interfering with application I/O performance. This causes the background copy process to be prolonged and hence leads to unavailability of an in-sync replica for longer periods of time. Embodiments of the present invention exploit application I/O without introducing expensive overhead and prevent the need to throttle down background copy rates during heavy application I/O load conditions.

Embodiments of the present invention disclose a method, computer program product, and computer system for accelerating a background replication process on storage devices during application input/output requests. More specifically, when a background replication process finds itself in competition with application I/O, embodiments of the present invention allow the background replication process to utilize the reads being performed by the application I/O and avoid duplicitous reads and time spent seeking data on discrete tracks. For example, embodiments of the present invention may fulfill READ requests issued by the background copy process from the disk buffer, thereby eliminating the need to read data from actual disk platters, on which the requested data is located on, and any corresponding disk arm movement.

As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer-readable media having computer readable program code/instructions embodied thereon.

Any combination of computer-readable media may be utilized. Computer-readable media may be a computer-readable signal medium or a computer-readable storage medium. A computer-readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of a computer-readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer-readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.

A computer-readable signal medium may include a propagated data signal with computer-readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer-readable signal medium may be any computer-readable medium that is not a computer-readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.

Program code embodied on a computer-readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java®, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on a user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).

Aspects of the present invention are described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer-readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.

The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer-implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.

The present invention will now be described in detail with reference to the figures. FIG. 1 illustrates a hard disk drive environment, generally designated 100, according to one embodiment of the present invention.

Hard disk drive environment 100 includes origin hard disk drive 102, target hard disk drive 118, storage controller 104, and computer 114. Origin hard disk drive 102 and target hard disk drive 118 can be, for example, Advanced Technology Attachment (ATA), Serial ATA (SATA), Small Computer System Interface (SCSI), or Serial Attached SCSI (SAS) compatible hard disk drives. Origin hard disk drive 102 and target hard disk drive 118 can be connected internally (i.e., an internal hard disk drive) or externally (i.e., an external hard disk drive) to computer 114. Computer 114 may be, for example, a desktop computer, a laptop computer, a tablet computer, a mobile computer device, or any other computer system known in the art.

Origin hard disk drive 102 includes disk buffer 106 and origin volume 110 connected to storage controller 104 which contains control logic 108. Also connected to storage controller 104 is target hard disk drive 118 which includes target volume 112. Storage controller 104 manages the transfer of data between origin hard disk drive 102, target hard disk drive 118 and computer 114. Storage controller 104 may be, for example, a disk array controller or any storage virtualization device (e.g., SAN Volume Controller). Control logic 108 provides control for performing an accelerated background replication process in accordance with embodiments of the present invention. Control logic 108 can be implemented, for example, with one or more electrical circuits (e.g., a plurality of hardware logic gates), firmware, and combinations of both.

A “background replication process,” as used in this specification, refers generally to the process of replicating origin volume 110 to target volume 112, while also fulfilling application I/O requests made by application 116. Origin volume 110 and target volume 112 can each consist of one or more sectors located on one or more tracks of origin hard disk drive 102 and target hard disk drive 118, respectively.

Disk buffer 106 is an embedded memory that stores data that is read from the platters (e.g., fulfilling an application READ request), as well as data that is to be written on the platters (e.g., fulfilling an application WRITE request). Data that is stored in disk buffer 106 can be accessed without seeking and seek delays.

It should be understood that, for illustrative purposes, FIG. 1 does not include other elements which may be present when implementing embodiments of the present invention, such as additional logic and components on storage controller 104 to perform additional processes. Additionally, many storage systems contain arrays of hard disk drives, and logical volumes may actually span a plurality of such disk drives. For example, origin volume 110 may, in one embodiment, actually span both origin hard disk drive 102 and a second hard disk drive.

FIG. 2 is a flowchart illustrating operational steps of control logic 108 for performing an accelerated background replication process in accordance with an embodiment of the present invention. In this exemplary embodiment, origin volume 110 located on origin hard disk drive 102 is being replicated to target volume 112 located on target hard disk drive 118 as a background process to fulfilling a READ request from application 116. For illustrative purposes, the READ request of this exemplary embodiment pertains to a single sector.

Control logic 108 determines that an accelerated background replication process should be performed (step 202). In this exemplary embodiment, control logic 108 determines that the accelerated background replication process should be performed by determining that the rate of data replication has fallen below a specified threshold. The rate of data replication may be calculated as the amount of data (e.g., in megabytes) replicated from origin volume 110 to target volume 112 over the total elapsed time of the replication process (e.g., in seconds). Responsive to the rate of data replication being equal to or above the specified threshold, a sequential background replication process is instead performed by sequentially reading sectors from origin volume 110 and copying those sectors to target volume 112. In another embodiment, responsive to the rate of data sent to fulfill a READ request from application 116 being less than a specified threshold, a sequential background replication process is instead performed by sequentially reading sectors from origin volume 110 and copying those sectors to target volume 112.

Control logic 108 receives a READ request from application 116 for data located in a sector (“requested sector”) of origin hard disk drive 102 (step 204). Origin hard disk drive 102 is aware of disk geometry information such as sector and track information through the use of small computer system interface (SCSI) commands. In an embodiment where data is stored on an array of hard disk drives, such as in a RAID configuration, a RAID controller can provide the disk geometry through SCSI commands. Based on the geometry of origin hard disk drive 102, control logic 108 instructs origin hard disk drive 102 to perform a READ by moving (i.e., seeking) the appropriate magnetic head to the track on which the requested sector is located, reading the requested sector, as well as sectors that precede and are subsequent to the requested sector, and storing those sectors in disk buffer 106 (step 206). The number of preceding and subsequent sectors that are read and stored in disk buffer 106 depends on the configuration of origin hard disk drive 102. Further, since platters of origin hard disk drive 102 rotate at constant speed and outer tracks typically contain more sectors than inner tracks, the number of preceding and subsequent sectors that are read and stored in disk buffer 106 can depend on the track on which the requested data is located.

Control logic 108 receives the requested sector from origin hard disk drive 102 (step 208). After receiving the requested sector, control logic 108 sends the requested sector to application 116 to fulfill the READ request (step 210). Control logic 108 then determines whether any of the preceding, requested, and subsequent sectors that were read and stored in disk buffer 106 in step 206 should be replicated for the background replication process (step 212). Due to the single rotation direction of the disk platter, preceding sectors represents the sectors located before the requested sector on the same track and subsequent sectors represents the sectors located after the requested sector on the same track. In this exemplary embodiment, control logic 108 determines whether any of these sectors are located in origin volume 110 and have not yet been copied to target volume 112.

Responsive to determining that one or more of the preceding, requested, or subsequent sectors should be replicated (yes branch, step 212), control logic 108 reads the sectors in need of replication from disk buffer 106 (step 214) and sends the sectors to target hard disk drive 118 to be written to target volume 112 (step 216).

Accordingly, in this exemplary embodiment, disk buffer 106 is utilized to reduce the seek delay that is experienced when a request for data coincides with data that needs to be replicated during a background replication process. Sectors of data that surround a requested sector, and may need to be replicated to target volume 112, are read and stored in disk buffer 106 during a read of the requested data sector, thereby avoiding additional seek delays. Further, when writing the sectors of data to target volume 112, the sectors of data are read from disk buffer 106, which again avoids additional seek delays. Eliminating additional seek delays helps increase performance of the background replication process and the fulfillment of application I/O requests.

FIG. 3 is a flowchart illustrating operational steps of control logic 108 for accelerating a background replication process in accordance with another embodiment of the present invention. In this exemplary embodiment, origin volume 110 located on origin hard disk drive 102 is being replicated to target volume 112 located on target hard disk drive 118 as a background process to fulfilling a READ request from application 116.

Control logic 108 determines that an accelerated background replication process should be performed (step 302). In this exemplary embodiment, control logic 108 determines that the rate of data replication has fallen below a specified threshold, as previously discussed with regard to FIG. 2. Storage controller 104 receives a READ request from application 116 for data located in a sector (“requested sector”) on a track of origin hard disk drive 102 (step 304). Control logic 108 converts the READ request from application 116 to include data located in all of the sectors on the track on which the requested sector is located (step 306).

Based on the known geometry of origin hard disk drive 102, control logic 108 then instructs origin hard disk drive 102 to perform a READ by moving (i.e. seeking) the appropriate magnetic head to the track on which the requested sector is located and reading all sectors on the track (step 308). Control logic 108 receives the sectors on the track from origin hard disk drive 102 (step 310) and sends the requested sector to application 116 to fulfill the READ request (step 312). Control logic 108 then determines whether any of the sectors on the track received in step 310 should be replicated for the background replication process (step 314). To do so, in this exemplary embodiment, control logic 108 determines whether any of these sectors are located in origin volume 110 and have not yet been copied to target volume 112.

Responsive to determining that one or more sectors on the track should be replicated (yes branch, step 314), control logic 108 sends the sectors to target hard disk drive 118 to be written to target volume 112 (step 316).

Accordingly, in this exemplary embodiment, control logic 108 is utilized to reduce the seek delay that is experienced when a request for data coincides with data that needs to be replicated during a background replication process. Here, sectors of data that may need to be replicated to target volume 112 are read and sent to control logic 108 by copying all data sectors on the track on which the requested data sector is located, which avoids additional seek delays and can increase background replication performance. This embodiment also has the unique ability to replicate an entire track during the course of single read. Increases in rotational latency (i.e., time delays associated with a full platter rotation to read the entire track) are typically negligible due to high rotational speeds of the platters.

In the embodiment discussed in FIG. 2, the transfer of sectors from disk buffer 106 to target volume 112 can be performed concurrently while the magnetic head seeks for a sector located on a different track than the previous track discussed in step 206. The dedicated bandwidth between origin hard disk drive 102 and target hard disk drive 118 allows for the data (i.e., sectors) to be transferred simultaneously between origin volume 110 and target volume 112 rather than the transfer of data being idle as the magnetic head seeks for sectors located on the different track. To further utilize disk buffer 106, a redirection of a READ request from origin volume 110 and target volume 112 can occur by synchronizing origin volume 110 and target volume 112. This allows for the determination step 212 to go directly to step 214 since the synchronization automatically allows for the sectors to be replicated to target volume 112.

FIG. 4 is a diagram of the surface of a platter illustrating a hypothetical scenario in which an accelerated background replication process is performed in accordance with the embodiment of FIG. 2. Platter 400 can be, for example, one of several platters within origin hard disk drive 102. Platter 400 includes ten tracks, where track 402 represents the innermost track. In this example, sectors 404, 406 and 408 on track 402 are located in origin volume 110. Sector 404 represents the sector being requested by application 116, sector 406 represents the sector preceding the requested sector and sector 408 represents the subsequent sector after the requested sector. As previously mentioned in the discussion of FIG. 2, sector 406 and sector 408 are read to ensure all of the requested data on sector 404 is obtained.

Sectors 404, 406 and 408 are read and stored in disk buffer 106. Control logic 108 receives and sends the requested sector 404 to application 116 to fulfill the read request. Control logic 108 then determines whether sectors 404, 406 and 408 should be replicated for the background replication process. Responsive to determining sectors 404, 406, and 408 should be replicated, control logic 108 reads sectors 404, 406, and 408 from disk buffer 106 and sends them to target hard disk drive 118 to be written to target volume 112 (not shown in FIG. 4).

FIG. 5 is a diagram of the surface of a platter illustrating a hypothetical scenario in which an accelerated background replication process is performed in accordance with the embodiment of FIG. 3. Platter 500 can be, for example, one of several platters within origin hard disk drive 102. Platter 500 includes ten tracks, where track 502 represents the innermost track. In this example, origin volume 110 includes all sectors on track 502.

A READ request by application 116 for data on sector 504 is made while a background replication process is being performed. Control logic 108 converts the request for data on sector 504 to include all surrounding sectors which are located on track 502 (i.e., sectors 506). Control logic 108 instructs origin hard disk drive 102 to read all sectors on track 502. Upon receiving all sectors on track 502, control logic 108 sends the requested sector 504 to application 116 to fulfill the READ request. Control logic 108 then determines if one or more of the sectors on track 502 should be replicated for the background replication process. Responsive to determining that one or more of the sectors should be replicated, control logic 108 sends the sectors to target hard disk drive 118 to be written to target volume 112 (not shown in FIG. 5).

While embodiments of the present invention are discussed with respect to hard disk drives, a person of ordinary skill in the art will recognize that other embodiments of the present invention may be applied to any storage device capable of seeking information on separate tracks. Any storage device with a disk buffer can eliminate the seek delays experienced by having to seek to the same track twice to fulfill a background replication process and an application I/O request.

The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems and methods according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions. 

What is claimed is:
 1. A method for replicating data from a first volume to a second volume, the method comprising: receiving a first data request comprising a request for a first portion of data, wherein the first portion is part of a first volume; reading the first portion of data, and reading at least a second portion of data, in addition to the first portion of data requested in the first data request; determining whether the second portion of data should be replicated to a second volume; and in response to determining that the second portion of data should be replicated to the second volume, writing the second portion of data to the second volume.
 2. The method of claim 1, wherein writing the second portion of data to the second volume further comprises: reading the second portion of data from a disk buffer of the storage device.
 3. The method of claim 1, wherein the second volume is stored on a plurality of hard disk drives in at least one remote storage system, and the first data request comprises an application data request.
 4. The method of claim 1, wherein the first volume is stored on a plurality of hard disk drives in an array of hard disk drives.
 5. The method of claim 1, wherein, subsequent to reading the first and at least the second portions of data, the method further comprises: seeking a third portion of data containing data requested by an application, wherein the seeking occurs at least partially in parallel with said writing the second portion of data to the second volume.
 6. The method of claim 1, wherein the method for replicating data is a background replication process, and the method further comprises: analyzing a rate of data transferred between the first and second volume during a replication of data from the first volume to the second volume; and in response to determining that the rate of data transferred is above a specified threshold, sequentially replicating additional portions from the first volume to the second volume.
 7. The method of claim 1, wherein the method for replicating is a background replication process, and the method further comprises: analyzing a rate of data transferred between the first volume and an application during fulfillment of the first data request, wherein the first data request is sent to the application; and in response to determining that the rate of data transferred is below a specified threshold, sequentially replicating additional portions of data from the first volume to the second volume.
 8. The method of claim 1, further comprising: converting the first data request to include all available portions of the first volume via one or more SCSI (small computer system interface) commands.
 9. A method for accelerating a background replication process on storage volumes during application I/O (input/output) requests, the method comprising: receiving a data request from an application; reading the requested data from a first storage volume; storing the requested data in an embedded memory device, and providing the requested data to the application; receiving a read request from the background replication process; responding to the read request from the background replication process by providing data from the embedded memory device to the requesting background replication process concurrently with providing data to the requesting application; and storing, by the background replication process, the data provided from the embedded memory device onto a second storage volume.
 10. The method of claim 9, wherein reading the requested data from a first storage volume further comprises: reading at least a first portion of data and a second portion of data from the first storage volume.
 11. The method of claim 10, wherein providing data from the embedded memory device to the requesting background replication process includes providing at least the first portion of data and the second portion of data from the first storage volume.
 12. The method of claim 9, wherein data from the first storage volume is stored on a plurality of hard disk drives in an array of hard disk drives.
 13. The method of claim 11, wherein, subsequent to reading at least a first portion and a second portion of the first storage volume, the method further comprises; seeking a third portion containing data requested by the application, wherein the seeking occurs at least partially in parallel with said storing, by the background replication process, the data provided from the embedded memory device onto a second storage volume.
 14. The method of claim 13, wherein the method of accelerating a background replication process on storage volumes during application I/O requests further comprises: analyzing a rate of data transferred between the first and second storage volumes during a replication of data from the first storage volume to the second storage volume; and in response to determining that the rate of data transferred is above a specified threshold, sequentially replicating additional portions from the first storage volume to the second storage volume.
 15. The method of claim 13, wherein the method of accelerating a background replication process on storage volumes during application I/O requests further comprises: analyzing a rate of data transferred between the first storage volume and an application during fulfillment of the data request, wherein the data request is sent to the application; and in response to determining that the rate of data transferred is below a specified threshold, sequentially replicating additional portions from the first storage volume to the second storage volume.
 16. The method of claim 9, wherein the embedded memory device is a type of cache. 